Wireless telepresence collaboration system

ABSTRACT

A method and system providing wireless personal telepresence facilitating collaboration by two or more persons, each in different locations, on a task at one of the locations and requiring visualization by multiple persons. A portable wireless unit captures/transmits video depicting the technician&#39;s first-hand field of view while keeping her hands free to perform the task. The expert employs a management console for visualization of the task being performed by the technician while communicating in real time with the technician. The management console also provides control over video and audio functions including record, playback, freeze frame and image attributes. Communication between the two persons is accomplished by exchanging digitally compressed video and audio via a voice or data network, either public or private. A community server augments the system by supporting three or more participants and enabling communication across public networks.

CROSS-REFERENCE TO RELATED APPLICATIONS

[0001] This application is the non-provisional of: Provisional patent application number: 60/336,014 Provisional patent application title: Wireless endpoint for videoconferencing Filing date: Dec. 5, 2001

STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH OR DEVELOPMENT

[0002] Not applicable

BACKGROUND OF THE INVENTION

[0003] 1.1 This invention relates generally to the field of computer supported collaborative work between plural users, and more particularly to a system for capturing and transmitting, via wireless means, images and sound of the user's immediate surroundings to as well as receiving images and audio from one or more other parties via a network connection in order to achieve a basic form of telepresence.

[0004] 1.2 Collaborative work often requires one or more parties to physically travel to a common location where the work is to be done. For example, an expert technician from headquarters will travel to a plant location to assist the local technician in troubleshooting and correcting an equipment problem. Said travel is time consuming and expensive, and can result in substantial delays in remedying a manufacturing issue requiring specialized technical expertise. In addition to technical field service, other examples where a visual means of collaborating on a task might be preferred over travel include security operations, home healthcare, emergency services, real-estate sales, various types of training and building inspections.

[0005] 1.3 While the telephone is a very useful means of communicating over long distances, the absence of a real-time visual component limits its effectiveness in situations where verbal description is inadequate. Videoconferencing has evolved to include dedicated as well as desktop (PC-based) systems that allow two or more parties to “see” each other via images transmitted across a computer network. Typically, each party sits in front of a camera a used to capture the image to be sent and a monitor to display the image being received. A microphone and speaker assembly perform the same functions for sound. Supplemental images are often captured using additional cameras, typically on a fixed copy stand assembly. Images generated using a personal computer or video recording device can also be transmitted and received. None of these solutions provide a real-time first person perspective or the ease of mobility afforded by the invention.

[0006] 1.4 Recent advances in cellular telephony have seen the integration of low resolution cameras to cellular handsets. Due to the limited bandwidth of the cellular wireless network and the limited processing power of the handset, these phone/camera combinations offer only low resolution, still images that take several minutes to send.

[0007] 1.5 Wearable computing has evolved to offer a wide array of application specific solutions for stock keeping, point of sale, meter reading to name a few. No prior art has yet combined the required user interface, processing power, compression technology and ergonomics to provide the unique personal telepresence solution described herein.

[0008] 1.6 Prior art in the related fields of the invention include the following patents: 4,845,636 July 1989 Walker 364/479 4,847,894 July 1989 Chanvin et al. 379/104 4,965,819 October 1990 Kannes 379/53  5,010,399 April 1991 Goodman et al. 358/85  5,164,979 November 1992 Choi 379/40  5,202,759 April 1993 Laycock 358/108 5,382,943 January 1995 Tanaka 348/143 6,307,526 October 2001 Mann October 345/8 

BRIEF SUMMARY OF THE INVENTION

[0009] 2.1 In accordance with a preferred embodiment of the invention, a wireless endpoint for personal telepresence comprises a small, lightweight, portable wireless unit (camera, display, microphone, speaker/earphone and wireless tranceiver) of size, weight and shape to allow it to be worn by the user or otherwise affixed and positioned to capture video of the user's immediate surroundings, receive video images from one or more parties, as in a videoconference, as well as provide for exchange of two-way audio. Said video and audio streams are relayed to a computer network via a presence server. Said presence server provides the necessary media and format conversion of the video and audio to make it compatible with the network and other endpoints in the communication. Return video and audio from other parties, as in a videoconference, is received by the portable wireless unit where it can be viewed on the display and output through a speaker/earphone.

[0010] 2.2 The purpose of the present invention is to provide an apparatus for exchanging video and audio across a computer network to allow two (or more) people at different locations to visualize the same first-person perspective in real time. This allows collaboration on tasks requiring visual queues without the need for either party to travel or otherwise spend time getting to the location where the task is to be performed. The image quality must be of such a quality as to allow a person viewing the images of the task to have approximately the same viewpoint as the person performing the task. Accomplishing this requires a specialized compression technique devised to provide clear images from a camera that may be head mounted and constantly moving, thereby causing the entire image field to change, instead of just a portion of it as would be the case with conventional videoconferencing.

[0011] 2.3 Other objects and advantages of the present invention will become apparent from the following descriptions, taken in connection with the accompanying drawings, wherein, by way of illustration and example, embodiments of the present invention is disclosed.

BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWING

[0012] 3.1 Drawing sheet 1 depicts a perspective view of an analog wireless telepresence collaboration system in the context of a computer network.

[0013] 3.2 Drawing sheet 2 depicts a schematic depicting the main components of an analog wireless telepresence collaboration system.

[0014] 3.3 Drawing sheet 3 depicts a perspective view of a digital wireless telepresence collaboration system in the context of a computer network.

[0015] 3.4 Drawing sheet 4 depicts a schematic depicting the main components of a digital wireless telepresence collaboration system.

[0016] 3.5 Drawing sheet 5 depicts a schematic depicting the main components of a wireless telepresence collaboration system including a community server.

[0017] 3.6 Drawing sheet 6 depicts a preferred embodiment of the portable wireless unit.

[0018] 3.7 Drawing sheet 7 depicts three views of a preferred embodiment of a headset assembly comprised of a miniature video camera, microphone and earphone.

DETAILED DESCRIPTION OF THE INVENTION

[0019] 4.1 Detailed descriptions of the preferred embodiments are provided herein. It is to be understood, however, that the present invention may be embodied in various forms. Therefore, specific details disclosed herein are not to be interpreted as limiting, but rather as a basis for teaching one skilled in the art to employ the present invention in virtually any appropriately detailed system, structure or manner.

[0020] 4.2.1 Drawing sheet 1 depicts a perspective view of an analog wireless telepresence collaboration system in the context of a network. Numerical references in the following refer to the numbered items in the figure. NOTE: Power supply to the various components of the invention are assumed and not shown for clarity.

[0021] 4.2.2 In a preferred embodiment of the invention, the user of the portable wireless unit (25) is a technician at a REMOTE LOCATION and needing the assistance of a colleague (at HOME BASE) on a task to be performed. The miniature video camera (2) is positioned by the user, either by wearing it on their person or affixing it to another object, so as to capture images of the TASK (1) being performed. The display (17) reveals the image being captured by the camera. Microphone (3) and earphone/speaker (24) allow two-way voice communication with other parties, as in a videoconference. Camera, display, microphone, earphone/speaker are connected (4) to a wireless tranceiver (5). In addition to a power switch (6), the wireless tranceiver has a capture start/stop control (7), a frequency selector control (8), a video input connector (18) and an audio connector (19). Video and audio are transmitted via wireless (9) to a presence server (10). The presence server converts the wireless video and audio to a media and format (11) compatible with the network and management console (14). The presence server has a frequency selector control (22). The management console is connected to the network (13).

[0022] In a preferred embodiment, the miniature video camera (2) is comprised of the model SCM-251 by Fong Kai Industries, Richardson, Tex. fitted with a 6 mm focal length board camera lens and infrared cut filter and conforming to the NTSC video standard.

[0023] In a preferred embodiment of the invention, the microphone (3) and earphone/speaker (24) are comprised of the model KX-TCA88 hands-free headset by Panasonic Corp., Secaucus, N.J.

[0024] In a preferred embodiment, the wireless tranceiver includes:

[0025] a model TM090100 video/audio transmitter by Lawmate, Taipei, Taiwan;

[0026] and a model RX09010 video/audio receiver by Lawmate, Taipei, Taiwan.

[0027] In a preferred embodiment, the presence server comprises

[0028] a model RX09010 video/audio receiver by Lawmate, Taipei, Taiwan;

[0029] a model TMO90100 video/audio transmitter by Lawmate, Taipei, Taiwan;

[0030] a model USBAV-170 video capture cable by ADS Technologies, Cerritos, Calif.;

[0031] a video codec capable of compressing/decompressing video at a 200:1 compression ratio in real time, such as the wavelet codec by Vianet Technology, Plano, Tex.;

[0032] and a Microsoft Windows® compatible computer of at least 500 MHz processing speed, at least one USB port and a suitable network connection.

[0033] In a preferred embodiment, the management console (14) comprises a Microsoft Windows® compatible computer of at least 500 MHz processing speed and a suitable network connection.

[0034] 4.2.3 At HOME BASE, the management console (14) is connected to the same network and displays the video image of the TASK (15) and renders the corresponding audio through a speaker or headphone (16). A camera (20) captures and sends video to be displayed on the presence server (21) and the display (17) in the field. A microphone (23) captures audio to be sent to the REMOTE LOCATION user's speaker/earphone.

[0035] 4.3.1 Drawing sheet 2 depicts a schematic of the main components of a wireless telepresence collaboration system. NOTE: Power supply to the various components of the invention are assumed and not shown for clarity.

[0036] 4.3.2 The ouput of the miniature video camera (30) is connected to a wireless video/audio tranceiver (33). The output of microphone (31) is connected to wireless video/audio tranceiver (33). Video and audio are transmitted (34) to a fixed tranceiver (35) which is connected to a presence server (36). The presence server compresses the audio and video and reformats it to be compatible with the network. The presence server is connected to a network (37). The network serves to transport formatted video and audio to and from the presence server.

[0037] 4.3.3 Video and audio is received from the network and unformatted and decompressed at the presence server (36) where it is converted to video and audio signals and sent to a fixed wireless video/audio tranceiver (35). Video and audio is transmitted wirelessly (34) to a portable wireless tranceiver (33) where the video and audio is rendered on the display (32) and a speaker/earphone (41).

[0038] 4.4.1 Drawing sheet 3 depicts a perspective view of a digital wireless telepresence collaboration system in the context of a network. Numerical references in the following refer to the numbered items in the figure. NOTE: Power supply to the various components of the invention are assumed and not shown for clarity.

[0039] 4.4.2 In another preferred embodiment of the invention, the user of the portable wireless unit (25) is a technician at a REMOTE LOCATION and needing the assistance of a colleague (at HOME BASE) on a task to be performed. The miniature video camera (2) is positioned by the user, either by wearing it on their person or affixing it to another object, so as to capture images of the TASK (1) being performed. The display (17) reveals the image being captured by the camera or being received from the management console (14) at HOME BASE. Microphone (3) and earphone/speaker (24) allow two-way voice communication with other parties, as in a videoconference. Camera, display, microphone, earphone/speaker are connected (4) to a presence server/wireless tranceiver (26). In addition to a power switch (6), the presence server/wireless tranceiver has a capture start/stop control (7), a frequency selector control (8), a video input connector (18) and an audio connector (19). The presence server/wireless tranceiver (26) converts the video and audio to a media and format compatible with the network and management console (14). The presence server/wireless tranceiver also has a frequency selector control (22). The presence server/wireless tranceiver is connected to the network (13) via a wireless access point (28). The presence server/wireless tranceiver transmits and receives video and audio over the wireless connection (27). The management console is also connected to the network (13).

[0040] In a preferred embodiment, the miniature video camera (2) is comprised of the model SCM-251 by Fong Kai Industries, Richardson, Tex. fitted with a 6 mm focal length board camera lens and infrared cut filter.

[0041] In a preferred embodiment of the invention, the microphone (3) and earphone/speaker (24) are comprised of the model KX-TCA88 hands-free headset by Panasonic Corp., Secaucus, N.J. In a preferred embodiment, the presence server/wireless tranceiver (26) includes: a handheld computer running Microsoft WindowsCE®, such as the model iPaq 3950 by HP/Compaq Computer, Houston, Tex.;

[0042] a video capture accessory, such as the model FlyJacket expansion accessory and video capture system by Animation Technologies, Taipei, Taiwan;

[0043] an 802.11 b compatible wireless LAN adapter, such as model DCF-660W compact flash wireless ethernet adapter by D-Link, Irvine, Calif.;

[0044] a video codec capable of compressing/decompressing video at a 200:1 compression ratio in real time, such as the wavelet codec by Vianet Technology, Plano, Tex.

[0045] In a preferred embodiment, the management console (14) comprises a Microsoft Windows® compatible computer of at least 500 MHz processing speed and a suitable network connection.

[0046] 4.4.3 At HOME BASE, the management console (14) is connected to the same network and displays the video image of the TASK (15) and renders the corresponding audio through a speaker or headphone (16). A camera (20) captures and sends video to be displayed on the presence server (21) and the display (17) in the field. A microphone (23) captures audio to be sent to the FIELD user's speaker/earphone.

[0047] 4.5.1 Drawing sheet 4 is a schematic depicting the main components of a wireless (digital) endpoint for personal telepresence. NOTE: Power supply to the various components of the invention are assumed and not shown for clarity.

[0048] 4.5.2 The ouput of the miniature video camera (30) is connected to a portable presence server/wireless tranceiver (42). The output of a microphone (31) is connected to a portable presence server/wireless tranceiver (42). Video and audio signals are compressed, reformatted and transmitted in digital format (43) to a wireless access point (44) which is connected to a network (37).

[0049] 4.5.3 Video and audio from the network is transmitted by the wireless access point (44) to the presence server (42). Video and audio are unformatted and decompressed by the portable presence server/wireless tranceiver (42). Video and audio is then rendered on the display (32) and a speaker/earphone (41).

[0050] 4.5 Drawing sheet 5 is a schematic depicting the major elements of a network connection between a presence server and a management console managed by a community server.

[0051] 4.6 In another preferred embodiment of the invention, exchange of video and audio between the presence server and the management console is initiated and managed by a community server connected to the same network. This addresses situations wherein either the presence server or the management console reside inside a secure firewall, thereby preventing direct communication with any parties outside said firewall. When the presence server (50) is activated, it initiates contact (61) with the community server (53), thereby establishing a two-way connection with the community server (64) for video and audio exchange. Likewise, the management console (52) initiates contact (62) with the community server and establishes a two-way connection with the community server (63) for video and audio exchange. Once both the presence server and the management console have established two-way communications with the community server, an access control function in the community server enables communication from the presence server to the management console and vice versa. This example illustrates communication between two parties, as in a videoconference. The scope of this invention is intended to include communication between three or more parties, through the addition of presence servers and/or management consoles as appropriate.

[0052] 4.7 In a preferred embodiment, the community server comprises an Intel Pentium® class computer running Microsoft WindowsXP®.

[0053] 4.8 Another preferred embodiment allows for a direct, peer to peer, two party communication to be established with the assistance of the community server. This addresses situations wherein direct communications is possible, but the network address of one or the other party is not known. The presence server (50) and management console (52) each initiate contact with the community server (53), and upon doing so learn the network address of each other, thereby enabling a direct connection (65) to be established between the presence server and the management console.

[0054] 4.9 Another preferred embodiment detects available bandwidth on the communication link and automatically adjusts the video frame rate and image resolution accordingly. The presence server and management console each use a similar mechanism to regularly monitor their respective communication link to determine the available bandwidth for sending video and audio. This measurement in turn is used to dynamically adjust the frame rate, image resolution or both for video being sent.

[0055] 4.10 Another preferred embodiment provides a motion processing mechanism to compensate for camera movement in the video compression technique. Such a motion processing mechanism consists of analyzing the changes in video images (i.e. individual frames) and discarding frames as needed to maintain image quality and minimize image latency. Methods are well known in the prior art to reduce bandwidth usage by detecting regions of an image that differ from frame to frame. The methods include strategies to encode, transmit, detect, and decode only those regions. This preferred embodiment improves the usefulness of a wireless collaboration system by including an implementation of these methods and strategies, such as provided by the Vianet Technologies, Inc. wavelet encoder. This embodiment extends the art by also including a method to completely eliminate frames that are “too late” to be useful due to camera motion. When the camera quickly moves, the method discards frames that were generated during the rapid movement allowing communication link bandwidth, presence server, and console image processing to focus on the useful frames derived when the motion stops.

[0056] 4.11 Another preferred embodiment provides for encryption of video and audio at its source (either the presence server or the management console), using encryption such as the hierarchical encryption scheme offered by Asier Technology, Plano, Tex. In such an embodiment, video and audio are first compressed and formatted, then processed by said encryption scheme before being transmitted across any communication link. Conversely, encrypted video and audio, once received, are first unencrypted, then decompressed for display and playback.

[0057] 4.12 Drawing sheet 6 depicts a preferred embodiment of the portable wireless unit comprising a wireless tranceiver and headset assembly. The headset assembly comprising a miniature video camera, microphone, earphone and headband is connected to the wireless tranceiver via a cable harness. Various controls and the display are disposed on the front of the wireless tranceiver.

[0058] 4.13 Drawing sheet 7 depicts a preferred embodiment of the headset assembly in three views. FIG. 1 depicts a perspective view. FIG. 2 depicts an elevation view. FIG. 3 depicts a profile view.

[0059] 4.14 While the invention has been described in connection with preferred embodiments, these are not intended to limit the scope of the invention to the particular form set forth, but on the contrary, are intended to cover such alternatives, modifications, and equivalents as may be included within the spirit and scope of the invention as defined by the appended claims. 

We claim:
 1. A system, comprising a portable wireless unit disposed to be used by a technician at a remote location, said portable wireless unit comprising a miniature video camera, display, microphone, speaker/earphone, wireless tranceiver and controls; and a presence server disposed to relay video and audio between the portable wireless unit and said management console, said presence server comprising video and audio compression, video and audio formatting, a communication interface to said portable wireless unit, and a communication interface to said communication link; and a management console disposed to be used by an expert at home base, said management console comprising software; and a communication link disposed to support data communications between the presence server and said management console, said communication link comprising a public or private, narrowband or broadband data connection; wherein said portable wireless unit is disposed to capture, transmit, receive and display video and audio; wherein said portable wireless unit is of size and weight so to allow it to be easily carried, worn or temporarily affixed to another object by said technician; wherein said miniature video camera is of size and weight so to allow it to be worn in a manner that captures said technician's first-person perspective (e.g. on the forehead, on the shoulder, adjacent to or in front of an eye) while keeping said technician's hands free; wherein said display is disposed to allow said technician to visualize images from said camera or said alternate video source, said display is disposed to allow said technician to visualize images being received from said management console, as in a videoconference; wherein said microphone is disposed to allow said technician's voice and other sounds to be captured; wherein said speaker/earphone is disposed to allow said technician to hear audio being received from said management console, as in a videoconference; wherein said wireless tranceiver disposed to relay video and audio from said miniature video camera and said microphone to said presence server, relay video and audio from said presence server to said display and said speaker/earphone, accept input of alternate video and audio from a source other than said miniature video camera and said microphone; wherein said controls are disposed to allow said technician to change the wireless radio frequency being used by said portable wireless unit to transmit and receive video and audio, said controls are disposed to allow said technician to select and adjust the video images being displayed on the said display, said controls are disposed to affect start, stop, pause, record, and playback of the video and audio being captured or transmitted; wherein said presence server is disposed to compress, decompress and format video and audio, record and playback video and audio; and wherein said management console is disposed to receive and display video from said presence server, send video images to said presence server, compress, decompress and format video and audio, record and playback video and audio, exchange two-way audio with said presence server, and exchange control messages with said presence server, said management console comprising software; and whereby said expert, using said management console, may view images of said task being performed by said technician, communicate with said technician and send video images to said technician.
 2. A system as in claim 1, wherein a community server is disposed to provide connection management, audio pooling, video retransmission and archival functions for one or more presence servers and management consoles, said community server comprising software disposed to run on a general purpose computer connected to said communications link.
 3. A system as in claim 1, comprising a camera that is of size and weight disposed to allow its use in spaces too small to enter or otherwise out of reach of said technician.
 4. A system as in claim 1, comprising connection management in said presence server disposed to detect loss and resumption of the wireless connection connection, and resume communications when said wireless connection is reestablished.
 5. A system as in claim 1, comprising auto-record/playback in said presence server disposed to automatically record video/audio upon loss of said wireless connection, and replay previously recorded video, audio and still images.
 6. A system as in claim 1, comprising multiple video support disposed to enable two or more video streams to be sent by said presence server to said management console, wherein said management console is disposed to display multiple video streams, whereby said expert is able to view multiple images of said task simultaneously.
 7. A system as in claim 1, comprising bandwidth management in the presence server disposed to automatically detect available bandwidth across said communication link and automatically adjust frame rate & picture quality accordingly.
 8. A system as in claim 1, comprising motion processing disposed to compensate for camera movement and maintain video image quality while minimizing image latency.
 9. A system as in claim 1, comprising encryption disposed to secure said video and audio wherein by said video and audio streams are encrypted across said communication link.
 10. A method, comprising operating said portable wireless unit; operating said miniature video camera; adjusting said miniature video camera; adjusting said microphone; operating said earphone/speaker; selecting the source of the video to be viewed on said display; adjusting said display; controlling communication start, stop, pause, record, playback functions; compressing said video and audio for transmission across said communication link; decompressing said video and audio for user on said display and said earphone/speaker; formatting said video and audio for transmission across said network; establishing a connection between said presence server to said management console, as in a videoconference; transmitting and receiving said video and audio via wireless means; adjusting the image size, frame rate and picture quality of said video being transmitted; whereby said technician and said expert may collaborate on said task, wherein said task requires visualization of said task by both parties.
 11. A method as in claim 10, comprising connecting said presence server and said management console to said community server; pooling of said audio by said community server; retransmission of said video by said community server; archival and retrieval of said video and audio by said community server whereby multiple presence servers and management consoles can exchange video and audio.
 12. A method as in claim 10, comprising detection of loss and resumption of the wireless connection whilst maintaining said communication between said presence server and said management console.
 13. A method as in claim 10, comprising automatic recording of video/audio upon loss of said connection and playback of said recorded video/audio.
 14. A method as in claim 10, comprising transmission, display and control of multiple video streams from said presence server to said management console.
 15. A method as in claim 10, comprising automatic detection of available bandwidth across said communication link and corresponding automatic adjustment of said video frame rate and picture quality.
 16. A method as in claim 10, comprising measurement of motion in said video field of view and corresponding automatic adjustment of said video frame rate and picture quality.
 17. A method, as in claim 10, comprising encryption of said video and audio across said communication link. 